Frequency-based Rare Events Mining in Administrative Health Data

نویسندگان

  • Jie Chen
  • Huidong Jin
  • Hongxing He
  • Christine M. O’Keefe
  • Ross Sparks
  • Graham Williams
  • Damien McAullay
  • Chris Kelman
چکیده

The low occurrence rate of adverse drug reactions makes it difficult to identify risk factors from a straightforward application of association pattern discovery in large databases. In this paper, we are interested in developing a data mining approach that can use the information about rare events in sequence data in order to measure the multiple occurrences of patterns in the whole period of target and non-target data. To address this, we define an interestingness measure which exploits the difference between the frequency of patterns in target and non-target sequence data. The proposed approach guarantees the easy generation of candidate patterns from the target sequence data by applying existing association mining algorithms. These patterns can then be evaluated by comparing their frequency in the target and non-target data. We also propose a ranking algorithm that takes into account both the rank of the patterns as determined by the interestingness measure and their supports in the target population. This algorithm can prune the patterns greatly and highlight more interesting results. Experimental results of a case study on Angioedema show the usefulness of the proposed approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Frequency-Based Temporal Pattern Mining in Health Data

The low occurrence rate of adverse drug reactions makes it difficult to identify the risk factors from straightforward application of frequent pattern discovery in large databases. In this paper, we are interested in developing a data mining strategy that can fully utilize the information around rare events in sequence data in order to measure the multiple occurrences of patterns in the whole p...

متن کامل

Rare Event Analysis of High Dimensional Building Operational Data Using Data Mining Techniques

Today’s building automation systems (BASs) are becoming increasingly complex. A typical BAS usually stores hundreds of sensor measurements and control signals at each time step, which produces massive high dimensional data sets. Traditional analysis methods for BAS data only focus on a small subset of the data, resulting in a huge information loss. Data mining techniques are more effective in k...

متن کامل

A Framework of Process Mining for RFID Event Analysis

As information systems and telecommunication devices are spread, many organizations accumulate a lot of events which are generated in performing business activities. The analysis of real-time data and events can play a critical role in implementing real-time enterprises and business intelligence. Recently, supply chain and manufacturing sectors have adopted ubiquitous environment that generate ...

متن کامل

Identifying fall-related injuries: Text mining the electronic medical record

Unintentional injury due to falls is a serious and expensive health problem among the elderly. This is especially true in the Veterans Health Administration (VHA) ambulatory care setting, where nearly 40% of the male patients are 65 or older and at risk for falls. Health service researchers and clinicians can utilize VHA administrative data to identify and explore the frequency and nature of fa...

متن کامل

Detection of adverse drug events: proposal of a data model.

Our main objective is to detect adverse drug events (ADEs) in former hospital stays. As ADEs are rare, that supposes to screen thousands of electronic health records (EHRs). For that purpose, we need to define a data model that has two main objectives: (1) being able to describe hospital stays from various hospitals (2) being tuned so as to prepare the data mining process: as ADEs are not flagg...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006